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APPLICATION FOR UNITED STATES PATENT 
Inv ntor: Marcia K. Wolf 

This application is a continuation of patent application USSN 
08/788,145 08/788,145 filed January 24, 1997, which is a continua- 
tion-in-part of patent application USSN 243,482 filed May 13, 1994, 
which is now abandoned. 
Field of the Invention: 

This invention is related to a CS6 antigen for use in vaccines 
to protect from pathological effects of enterotoxigenic E. coli . 
Background of the Inventions; 

CS6 is a component of CFA/IV (colonization factor antigen IV) , 
one of three CFAs commonly found on enterotoxigenic Escherichia 
coli (ETEC) . A recent study showed CS6 on 31% of ETEC isolated 
from soldiers in the Middle East. Other CFAs and similar proteins 
found on the surface of ETEC function as adhesins to attach 
bacteria to intestinal epithelial cells. Attached -bacteria can 
then deliver their toxin (s) to the target cells. It has never been 
proved that CS6 is an adhesin for human tissue (Knutton, S., M. M. 
McConnell, B. Rowe, and A. S. McNeish, "Adhesion and Ultrastructur- 
al Properties of Human Enterotoxigenic Escherichia coli Producing 
Colonization Factor Antigens III and IV", Infect. Immun. 57:3364 — 
3371 (1989)), but a study in rabbits indicated CS6 is a coloniza- 
tion factor. 

The CS6 operon has much in common with fimbrial operons from 
E. coli, Salmonella, Yersinia, Klebsiella, Haemophilus, and 
Bordetella. All contain molecular chaperons and ushers and a 
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number of structural subunits. This area contains two sequences 
homologous to insertion sequences, but no complete insertion 
sequences . 

The low GC content (34%) and codon usage that is characteris- 
tic of E. coli genes that are expressed at low levels suggest the 
CS6 genes may have originated in another species. GC content of 
35-45% is characteristic of Gram positive bacteria sufch as 
Staphylococcus, Streptococcus, Bacillus, and Lactobacillus. Low GC 
content is common for virulence-associated genes of E. coli . 

CS6 is unusual because it is expressed on bacteria grown on a 
variety of media, unlike other CFA's from ETEC that are only 
expressed on bacteria grown on CFA agar. This unusual regulation 
is not peculiar to strain E877 5 because ETEC isolated in 1990 
expressed CS6 when grown on L agar. Temperature regulation of CS6 
expression is characteristic of other CFA's from ETEC and virulence 
genes in a variety of pathogenic bacteria. 

Although CS6 has never been visualized by negative staining, 
electron microscopy using anti-CS6 sera and colloidal gold 
indicated that it is present on the surface of ETEC. The apparent 
major protein associated with CS6 is approximately 16 kDa which is 
in the range of molecular weights typical for subunits for fimbriae 
and fibrillae. CS6 from ETEC strain E10703 of serotype 0167 :H5 has 
been cloned (Willshaw, et al., FEMS Microbio. Let . 49: 473-478 
(1988)). Only 3 kb of DNA was necessary for expression of CS6. 
That is in contrast to fimbriae that typically require approximate- 
ly 9 kb of DNA and include genes for subunits as well as proteins 
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I) • 

for transport of subunits and synthesis and assembly on the 
bacterial surface. 

Grewal teaches bacterial strains transformed with plasmids 
containing genes which encode CS6 . However, that reference does 
not teach use of plasmids under the controls of a lac promotor and 
a CS6 promotor. 

Brief Description of the Drawings: 

Figure 1 shows the restriction sites and the location of the 
pertinent genes that make up the CS6 operon. 

Figure 2 shows derivation of the clone containing the 
kanamycin resistance gene. 
Description of the Invention; 

It is the purpose of this invention to provide structural 
proteins which will act as antigens to stimulate protective 
antibodies against enterotoxigenic Escherichia coli . Particularly 
important are proteins having the antigenic properties of the 
proteins encoded by the css A and css B genes. Constructs may be 
prepared which encode either one or both of the proteins. However, 
both proteins would be needed to provide desirable protection. The 
CS6 operon includes four genes which we designate cssA, cssB, css C r 
and css D. css A and Css B encode the structural proteins of CS6. 
The CS6 operon has much in common with fimbrial operons from E. 
coli . Salmonella, Yersinia, Klebsiella, Haemophilus, and Bordetel- 
la. All contain molecular chaperons and ushers and a number of 
structural subunits. In a preferred embodiment , plasmids containing 
all four genes are transformed into attenuated bacteria, which are 
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then given by mouth to prevent morbidity arising from infection 
with E, coli . 

CS6 has two major subunits; protein sequencing data demon- 
strates that CS6A and CS6B are both present. The DNA sequence 
yields a mechanism for expression of similar amounts of the two 
proteins. The CS6 operon contains DNA immediately downstream of 
cssB which can form a stem-loop with a stem rich in G and C which 
commonly act as transcription terminators. Termination at this 
site yields a transcript with cssA and cssB such that CssA and CssB 
proteins would be translated in equal amounts. Fimbrial operons 
for Pap, K99, and K88 have stem loops immediately downstream of the 
genes for the major coding structural subunits. This has been 
offered as a mechanism for over expression of subunit genes 
relative to other genes in the operons. In the case of CS6, this 
would allow over expression of both CS6A and CS6B. 

The occurrence of two major structural proteins is unusual 
because fimbriae have a single major subunit and a number of 
minor subunits. CS3, which has been designated fibrillar rather 
than fimbrial, is an exception to this generality because it has 2 
subunits. CssD belongs to the family of molecular ushers located 
in the outer membrane that accept subunits from the chaperone and 
escorts them to the bacterial surface. Apparently the entire css D 
gene is not necessary for CS6 expression since CS6 was detected 
from clones carrying pDEP5 which only contains the N-terminal one- 
third of css D. Klemm and Christiansen found that mutations in the 
usher for Type 1 fimbriae reduced fimbriation but 10% of the 



4 



bacteria produced a few fimbriae ( Mol .Gen. Genet. 220:334-338). 

The CS6 proteins are produced in the transformed bacteria and 
are present on the exterior surface of the bacteria. These 
proteins give rise to immunological response in the host. For 
immunization, the bacteria may be given either dead or alive. When 
attenuated bacteria have been transformed, the bacteria can be 
given live in mildly basic carriers. .Economical and readily 
available carriers include carbonated water which may be flavored. 
The administration of the transformed bacteria in carbonated 
beverages is particularly useful, since the means necessary for 
administration is widely available. 

In a preferred embodiment, the products are produced under the 
control of a lac promotor from pUC19 . In the preferred embodiment, 
a vector pM346 containing a kanamycin resistant gene makes it 
possible to provide products which are appropriate for use in 
humans . 

The CS6 proteins may also be extracted from the supernatant of 
the culture containing the organisms which express the proteins. 
The proteins may then be administered orally. The proteins may be 
formulated by means known in the art, including microencapsulation, 
coated capsules and liposomes. The proteins may be lyophilized 
before formulation . 
MATERIALS AND METHODS 
Source of nucleic acid 

The genes for CS6 expression were from enterotoxigenic 
Escherichia coli (ETEC) strain E8775 tox" of serotype 025:H42 which 
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was a gift from Alejandro Cravioto. E8775 tox" is a derivative of 
E. coli strain E8775 which was originally isolated from Bangladesh. 
DH5a which was purchased from Bethesda Research Laboratories, Inc., 
Gaithersburg, MD. pUC19 was originally purchased from P-L Biochem. 

The antibiotic resistance gene encodes resistance to kanamycin 
and was purchased from Pharmacia, Uppsala, Sweden (Kan R GenBlock®) . 

CS6 expression is regulated from its native promoter. That is 
demonstrated by retention of control by growth temperature and is 
consistent with the DNA sequence determined from the clone. A 
contribution of the lac promoter from pUC19 is undefined. The 
contribution of increased copy number of the plasmid is probably 
substantial . 

The nucleotide sequence containing the coding region was 
determined to be as constructed containing the kanamycin resistance 
gene was as follows: 
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AAGCTTGTAA 


CCAGTTGATA 


AAAATATATC 


ACGCTGGGAA 


TGACGTGATG 


51 


TATATACGGA 


GCAGCTATGT 


CGGAACAGAT 


ATTTTCCTAT 


CGGTATGCGT 


101 


TGTGAGTAAG 


CGTAAAGCCA 


ATGCTGTCTG 


TAACTCCTGA 


TCCTTGCAGA 


151 


CTAAATTAGA 


GCTCCTTCTA 


AATTAGACGG 


ATGGATAAAC 


CTACAGACTG 


201 


GCGCTCTGGG 


TCTCGCCGGA 


TATTTTCTAA 


TGAATTTAAG 


CTTCATATGG 


251 


TTGAACTGGC 


TTCGAAACCA 


AATGCCAATG 


TCGCACAACT 


GGCTCGGGAA 


301 


CATGGCGTTG 


ATAACAACCT 


GATTTTTAAA 


TAGCTACGCC 


TCTGGCAAAG 


351 


AGAAGGACGT 


ATTTCTCGTA 


GAATGCCTCC 


AACTATTGTA 


GGCCCTACAG 


401 


TACCACTGAG 


GTAGCCTGAA 


TTTAAAGCCG 


AAGCGGTCAG 


AACTGTTCTT 


451 


GGTGTGAACG 


TAGCACTCAC 


CAATAAAAGC 


ATCAATACGG 


TGCTCTGTTG 


501 


ACACATTACG 


AATGTTATGT 


ATACAATAAA 


AATGATTATA 


GCAATATTAA 
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551 TGGTGTTATA TGAAGAAAAC 

601 CGGCAGCCAT GCCAGAACAG 

651 CGACTATTTC AAAAAGTTTT 

701 TTTGGTGAAA ATGTTGGAAA 

751 AACTGTTCCT GAAAATGTAT 

801 AAGATTATGG GTTAGGACGA 

851 ATAATCTACC AGATTGTTGA 

901 TGGTGCAGAG GTTACACCTA 

951 ATACTAGCGG GGAAAAAAAA 

1001 ATGGTTGGTT ACTATGTAAA 

1051 AAAATTATTT CGGCTATTGC 

1101 TGCAGGAAAC TGGCAATATA 

1151 AAAATTTTAT TCCAGATATT 

1201 TACGATTCGG ACCCGAAACT 

1251 GATCCCTGCA GGTGTAAGCG 

1301 CATCTTCTGG ACAGCAGATC 

1351 CAAAATATGA ATTATTATAT 

1401 GGCAGGACAA AAAGGATCCT 

1451 TCTCAGCAAT TTATACTGGT 

■1501 GGTACTTATG CAGGAAATTT 

1551 GGCCGCATTA TTGCGGCCAT 

1601 CAAAGTTAAT TATACTATTG 

1651 GGAAATAATT TTGAAATAAA 

1701 ACCATCAGTT CAAATATCAA 

1751 GCAATGTATG GGATGAAAGC 

1801 CCACCGATTT TTAAAATGGA 



AATTGGTTTA ATTCTAATTC TTGCTTCATT 
AAATAGCGAC TAAAAACTTC CCAGTATCAA 
TTTGCACCTG AACCACGAAT ACAGCCTTCT 
GGAAGGAGCT TTATTATTTA GTGTGAACTT 
CCCAGGTAAC GGTCTACCCT GTTTATGATG 
CTAGTAAATA CCGCTGATGC TTCCCAATCA 
TGAGAAAGGG AAAAAAATGT TAAAAGATCA 
ATCAACAAAT AACTTTTAAA GCGCTGAATT 
ATATCTCCTG GAATATATAA CGATCAGGTT 
CTAAATACTG GAAGTATGAT TATGTTGAAA 
ATTAATTGCA GGAACTTCCG GAGTGGTAAA 
AATCTCTGGA TGTAAATGTA AATATTGAGC 
GATTCCGCTG TTCGTATAAT ACCTGTTAAT 
GGATTCACAG TTATATACGG TTGAGATGAC 
CAGTTAAAAT CGCACCAACA GATAGTCTGA 
GGAAAGCTGG TTAATGTAAA CAATCCAGAT 
CAGAAAGGAT TCTGGCGCTG GTAACTTTAT 
TTCCTGTCAA AGAGAATACG TCATACACAT 
GGCGAATACC CTAATAGCGG ATATTCGTCT 
GACTGTATCA TTTTACAGCA ATTAAAAAAA 
TGACGATACT GCTAGGCAAA AATATGAAAT 
ACGTTAGTGC CATTTTCATC TTTTTCAACA 
TAAGACACGA GTAATTTACT CTGACAGCAC 
ATAATAAAGC ATATCCTTTA ATTATTCAAA 
AATAATAAAA ATCATGACTT TATAGCAACA 
AAGTGAAAGT CGGAATATAA TAAAAATAAT 
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1851 TAAAACAACT ATTAATTTGC CGGACTCTCA GGAAAGTATG AGATGGTTAT 

1901 GTATTGAATC AATGCCACCA ATAGAAAAAA GTACTAAAAT AAACAGAAAA 

1951 GAAGGAAGGA CAGACAGTAT TAATATCAGC ATTCGGGGGT GCATTAAACT 

2001 GATATATCGA CCTGCCAGTG TTCCGTCTCC TGTTTTTAAT AATATAGTAG 

2051 AAAAATTAAA ATGGCATAAA AATGGAAAGT ATCTTGTATT AAAAAATAAT 

2101 ^ACACCCTATT ACATTAGCTT TTCTGAGGTT TTTTTTGATT CAGATAAAGT 

2151 AAACAATGCA AAAGATATTT TATATGTAAA ACCATACTCA GAGAAGAAAA 

2201 TAGATATCAG CAACAGAATA ATAAAAAAAA TCAAATGGGC TATGATTGAT 

2251 GATGCTGGCG CAAAAACAAA ACTTTATGAA TCAATTTTAT AAAAAATCTC 

2301 ATTACAGTAT ACAAAAACAT CAGATTACAG GCTTGCTTTT TTTGCTATTT 

2 351 ATATATCCTT TCTCAACCTC ATATGGAAAT GAACAATTTA GTTTTGACTC 

2 4 03, ACGATTCCTA CCATCAGGTT ATAATTACTC TTTAAATAGT AACTTACCTC 

2451 CTGAAGGTGA GTATCTGGTT GATATTTATA TTAACAAAAT AAAAAAGGAG 

2501 TCCGCGATTA TTCCTTTTTA TATAAAAGGA AATAAACTTG TACCATGTTT 

2551 ATCAAAAGAA AAAATTTCAT CTTTGGGTAT CAACATTAAT AATAACGACA 

2601 ACACAGAGTG TGTAGAAACA AGTAAGGCAG GTATTAGTAA TATCAGCTTT 

2651 GAGTTTAGCT CTCTTCGTTT GTTTATTGCT GTACCGAAAA ATCTTCTGTC 

2701 TGAGATTGAT AAAATATCAT CAAAGGATAT AGATAACGGG ATTCATGCTT 

2751 TATTTTTTAA TTATCAAGTA AATACAAGGC TAGCCAATAA TAAAAATCGT 

2801 TATGATTACA TTTCTGTTTC ACCAAATATA AATTATTTTT CATGGCGGTT 

2851 GCGTAATCTT TTTGAATTTA ACCAAAACAA CGATGAAAAA ACATGGGAAA 

2901 GAAACTACAC TTATCTAGAA AAAAGTTTTT ATGATAAAAA GCTAAACTTA 

2951 GTCGTTGGTG AAAGTTATAC GAATTCAAAT GTTTATAATA ACTACTCTTT 

3001 TACTGGTATT TCAGTTTCTA CAGATACAGA TATGTATACG CCAAGTGAAA 

3051 TCGATTATAC ACCAGAAATT CATGGAGTGG CTGATTCAGA CTCTCAGATT 

3101 ATTGTCAGGC AAGGCAACAC CATTATCATT AATGAAAGTG TTCCAGCCGG 
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3151 ACCGTTCTCA TTTCCAATAA 

3201 ATGTGGAGAT AACAGATATT 

3251 AATTCCTCTC TTCCTGTTAT 

3301 TATATCTGGG AAATTAACAA 

3351 CTCAAGGTGA TATTAACTAC 

3401 GGATATCAGT TTAGTAAAAA 

3451 TGATCTGGGA TTTTCTGGAG 

3501 TTAAGAATAA AAATGGATAT 

3551 AGACCATTCA ATGCCGGGGT 

3601 GTATGTGGAA CTTTCCGACA 

3651 TTAAAAATAG TTTTTCTTTA 

37 01 AATTTCTCAC TTGATTATAA 

3751 TAGTAACTCA ATGT CGATTC 

3801 TTACAACAAA TTGTTCTTTA 

3851 AAAAGATTTA GTATTAATAT 

3901 CATATCTTCA AACTATTCAT 

3951 GTTCTGTAGG CTTAAACGGT 

4001 AACATTCAGC AGAACAGAAC 

4051 CAGTTACATA GCAACCAGCT 

4101 CATATTCAGG ATCGAACAAG 

4151 GGTATTGTTT TGCATAGCGA 

4201 TACCTCTGCT CTTGTCCGTA 

4251 ACACTCCTGG TGTTTATACT 

4301 CTTCAGCCGT TCAAAAAAAA 

43 51 AGACGGTATT ACTCTTGCTA 

4401 GAGCTATTGT TAAAGTAAAA 



CCAATCTCAT GTATACTGGG GGGCAACTTA 
TATGGAAATA AAAAACAATA TACTGTCAAT 
GAGAAAAGCG GGACTAATGG TATATAATTT 
AAAAAAATAG TGAGGATGGT GATTTTTTTA 
GGTACTCACT ATAACAGCAC ACTATTCGGT 
TTATTTTAAC TTATCTACTG GTATAGGCAC 
CATGGCTACT ACACGTTAGC AGAAGTAATT 
AATATTAATC TACAACAAAA CACTCAGTTA 
TAATTTCGAT TACGCATACA GAAAAAAAAG 
TTGGCTGGCA TGGTAATTTA TATAATCAAC 
TCCTTGTCAA AATCATTGAA TAAATACGGA 
CAAAATGAAA TACTGGGATA ATGCGTATGA 
GTTATTTTTT TAAATTCATG CGAGCAATGA 
AATAAATATC AATCTTATGA AAAAAAAGAT 
ATCATTGCCT TTAACCAAAG ATTACGGGCA 
TTTCCAATGC AAATACAGGA ACGGCAACCA 
AGTTTTTTTA ATGACGCAAG ATTAAACTGG 
GACCCGTAAC AATGGATATA CTGATAATAC 
ATGCCTCTCC CTATGGCGTT TTTACTGGTT 
TATTCAAGCC AGTTTTATTC TGCATCGGGA 
TGGCGTAGCT TTTACTCAAA AAGCCGGAGA 
TTGATAATAT TTCTGATATA AAAATTGGTA 
GGGTATAATG GTTTTGCTTT AATTCCTCAT 
CACCATTTTA ATTAATGATA AAGGAATTCC 
ATATAAAAAA ACAAGTTATC CCATCACGAG 
TTTGATGCTA AAAAAGGCAA TGACATTTTG 
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4451 TTTAAGCTTA CAACTAAAGA TGGAAAAACG CCCCCATTAG GAGCTATAGC 

4501 CCATGAAAAA AATGGAAAAC AGATTAATAC GGGTATCGTT GACGATGATG 

4551 GTATGCTTTA TATGTCTGGA TTATCAGGGA CAGGGATTAT TAATGTAACA 

4601 TGGAATGGAA AAGTCTGTTC ATTTCCTTTT TCAGAAAAAG ATATATCTAG 

4651 CAAACAATTA TCTGTTGTAA ATAAACAATG TTAGGTAGTG CATCCAATTA 

4701 GTAGAACATG TGTTTTTCGA TAAACGCTCC GATCTCTTTT TCGTGGATCT 

4751 CAACTGAGCG TGAGAAGCAG ATTGTTTTAC GAGCCAACCG CTTAATGCGG 

4801 GTGCGTAGCG TCAGATTATT ACGCTCAATG CGTTGGGTGA ATATTTTGCC 

4851 GGTCAGATGC TTATTCTTCG GTACC Sequence ID No, 1 

B. Cell expression clone : 

E. coli HB101 was purchased from the American Type Culture 
Collection, Rockville, Maryland. It is ATCC #33694 and batch #91- 
1. (Escherichia coli ATCC 33694) 

Preceptrol [Reg TM] culture. D. Ish-Horowicz and J.F. Burke HB101 

< H. Boyer. Genotype: F- leuB6 proA2 recA13 thi-1 ara-14 

lacYlgalK2 xyl-5 mtl-1 rpsL20 supE44 hsdS20 (r- B m- B at least 
thi-hsd from Escherichia coli B) . Produces isoprene (Curr. 
Microbiol. 30:97-103, 1995). J. Mol. Biol. 41: 459-472, 1969; 
Methods Enzymol. 68: 245-267, 1979.) Growth Conditions: Medium 
1065 37C. 

The plasmid containing the CS6 genes, the pUC19 origin of 
replication, and the gene for kanamycih resistance was transferred 
into E. coli HB101 by transformation. Transf ormants were selected 
by growth on L agar supplemented with 0.04% Xgal with 50 (igm per ml 
kanamycin sulfate and/or 50 nqm per ml ampicillin. 

One copy of the CS6 genes exists as an extrachromosomal 



plasmid of high (500-700) copy number. The CS6 genes are present 
on a plasmid, not integrated into the chromosome. The plasmid has 
been isolated from the strain and examined by agarose gel electro- 
phoresis. 

Plasmid DNA from E8775 , tox" was transferred to laboratory 
strain DH5a as a cointegrate with F' lac ts : :Tn5, a conjugative 
plasmid. Transfer of the F 1 lac ts : :Tn5 plasmid was selected by 
antibiotic resistance to kanamycin and CS6 expression was detected 
by Western blot using polyclonal antisera specific for CS6. 
Plasmids were isolated and a cointegrate was identified based on 
the large size. A spontaneous derivative in which the F' lac ts ::Tn5 
was removed was obtained and named M56. M56 contains a 61- 
megadalton plasmid from E8775 tox" and expresses CS6. Plasmid DNA 
from M56 was isolated, partially digested with restriction enzyme 
Hindlll, and ligated to pUC19 that had been digested with Hindlll. 
The ligation mixture was transformed into DH5a and plated onto L 
agar supplemented with ampicillin and X-gal. White (lac") colonies 
were picked to CFA plates supplemented with ampicillin and tested 
for CS6 expression. 

A stable clone named M233 with an insert of approximately 24 
kb into the cloning site of pUC19 was obtained. It was a spontane- 
ous deletion of a larger clone. Subclones were obtained by 
digestion with various enzymes and a subclone containing approxi- 
mately 5 kb from the Hindi 1 1 site to Kpnl site was found that 
expressed CS6. This clone was designated M285. Expression of CS6 
was verified by transferring plasmids into E. coli strain HB101 and 
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detecting CS6 expression. The cloned CS6 is expressed under the 
same conditions as CS6 from the native 61-megadalton plasmid: CS6 
was detected in extracts from bacteria grown at 37 °C on CFA agar, 
L agar or MacConkey agar. CS6 was not expressed on bacteria grown 
at 17 # C. 

Studies were performed to determine appropriate handling of 
strain M285 for reproducible expression of CS6. Growth temperature 
was found to be especially important. 

As indicated above, the protein sequence of the N-terminus of 
CS6 was determined from strains E8775 and from M233, the large 
clone derived from E8775. The 16 kDa proteins recovered from heat, 
saline extracts, and ammonium sulfate precipitation of M233 yielded 
two amino acids at each position (except cycle 12) indicating that 
two proteins were present. From the strength of the two signals, 
a probable primary sequence and a probable secondary sequence call 
was made for each of fifteen cycles. Quantitative analysis of the 
peak areas indicated that the molar ratio of the primary sequence 
(CS6A) to secondary sequence (CS6B) was approximately 3:1. The 
presence of the same two proteins was evident from strain E8775 
grown on CFA agar and on L agar. 

The DNA sequence of the DNA inserted into pUC19 in clone M285 
was determined. Wim Gaastra' s group in the Netherlands indepen- 
dently determined the DNA sequence of CS6 genes from ETEC strain 
E10703. The DNA sequences are available from Genebank accession 
numbers U04846 and U04844, respectively. A stretch of DNA of 4,219 
base pairs was 98% identical. The DNA sequences diverge abruptly 
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on both sides of the common region, defining the limits of the CS6 
genes. Four open reading frames were detected within the common 
area. These were designated cssA, css B, css C, and css D. 

The four open reading frames are preceded by consensus 
sequences for binding RNA polymerase and ribosomes. The first open 
reading frame, cssA was identified as the gene for the CS6 
structural protein designated as the primary protein based on the 
amino acid N-terminal sequence. The deduced molecular weight 
agrees with that previously determined from SDS PAGE. cssA 
includes a signal sequence that is typical for many exported 
proteins. Eleven of 13 6 residues differ between the deduced CssA 
proteins from E8775 and from E10704. 

css B begins 17 bases downstream from cssA. There is a typical 
signal sequence. css B was identified as the gene for the CS6 
structural protein designated as the secondary protein based on the 
amino acid N-terminal sequence. Five of 146 residues differ 
between the deduced CssA proteins from E8775 and from E10704. 

A region of dyad symmetry is present 6 bases downstream from 
CssB in both clones. The sequence is GGCCGCA TTAT TGCGGCC (Sequence 
#2) in E8775 ETEC and GGCCGCA TTATTGAT TGCGGCC (Sequence #3) in 
E10703. Underlined bases form the G-C rich stem. The calculated 
free energy value of these structures is -14.8 kcal. Such 
structures are often found in fimbrial operons after the genes 
encoding structural proteins. 

css C begins 48 bases downstream from css B. It has a typical 
signal sequence. The deduced proteins from both clones have 212 
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residues with 7 differences. A search of protein databases 
indicated CssC is homologous to chaperone proteins necessary for 
expression of a number of fimbriae* The structure of PapD, the 
chaperone protein for Pap fimbriae, has been solved by X-ray 
crystallography and regions important for conserving the structural 
domains have been identified. CssC conforms to this consensus. 

The cssD gene begins 14 bases upstream of the end of cssC. 
The protein from E8775 is truncated relative to the protein from 
E10703 and there are 28 differences between CssD from E8775 and 
E10703. The deduced protein from cssD is homologous to molecular 
ushers. Overall, CssD and the other proteins are only around 3 0% 
identical and around 50% similar, but the nine proteins have areas 
of high homology dispersed throughout, especially the first 410 
residues, and 4 cysteines (residues 91, 112, and two near the C- 
terminus) which are conserved in all ushers. 

A region of dyad symmetry is present 347 base pairs into the 
CssD gene in both clones. The calculated free energy value of 
these structures is -7.2 kcal. 

The plasmid from strain M285 was transformed into E. coli 
HB101 purchased from ATCC. The resulting strain was named M295. 
Expression of CS6 from M295 was achieved from small-scale fermenta- 
tion. For production for human use, it was desirable to add a gene 
for resistance to kanamycin as the selectable marker. To that end, 
a vector was constructed based on pUC19 but with a gene for 
kanamycin resistance in place of the gene for ampicillin resis- 
tance. The CS6 genes from the pUC19 clone were subcloned into the 
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new vector and transformed into E. coli HB101. 

Vector pM323 was constructed as follows. The kanamycin 
resistance gene was purchased from Pharmacia, Uppsala, Sweden 
(Kan R GenBlock®) and inserted into a cloning vector by Dr. David 
Lanar at WRAIR. DNA including the gene was amplified by PCR using 
the plasmid from Dr. Lanar as template and primers flanking the 
multiple cloning site. A product of the desired size (1,580 bp) 
was obtained, but with much template present. To increase the 
purity of the 1,580 fragment, a second PCR reaction was performed, 
this time with a small amount of the first PCR reaction as 
template. This product was confirmed by agarose gel electrophore- 
sis, then digested with restriction enzyme Hindi to remove 
unwanted restriction enzyme recognition sites. This product was 
ligated to pUC19 digested with Sspl. The ligation mix was 
transformed into E. coli DH5a and plated on L agar plates supple- 
mented with kanamycin and Xgal. Isolate M318 had the desired 
phenotype of resistance to kanamycin and ampicillin with lacZ' 
intact. The gene for ampicillin resistance was removed to make a 
smaller vector. This was achieved by designing and synthesizing 2 
oligonucleotides to amplify just the portion of pM318 with the gene 
for kanamycin resistance, the lacZ' gene carrying the multiple 
cloning site, and the origin of replication. PCR was performed, 
the product ligated then transformed into E. coli DH5a with 
selection on L agar plates supplemented with kanamycin and Xgal. 
Isolate M323 had the desired phenotype of resistance to kanamycin, 
sensitivity to ampicillin, and intact lac Z* . Restriction digest 
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patterns confirmed the plasmid was a derivative of pUC19 with the 
gene for kanamycin resistance in place of the gene for ampicillin 
resistance. 

The CS6 genes were cloned into vector pM32 3 from pM285. pM323 
and pM285 were digested with restriction enzyme SstI, ligated, and 
transformed into E. coli DH5a with selection on L agar plates 
supplemented with kanamycin and Xgal. Isolate M334 was determined 
to express CS6 . Plasmid analysis revealed M334 carried the CS6 
genes and 2 copies of the vector. An attempt was made to remove 
one copy of vector and at the same time move the clone into HB101, 
the desired host strain for fermentation. Isolate M340 was 
determined to express CS6 and retained 2 copies of the vector. An 
isolated colony of M340 was shown to produce high amounts of CS6 
and was saved as M3 46. 

In another embodiment lacking the kanamycin resistance gene, 
clones from an ETEC strain of serotype 025:H42 were derived from E. 
coli E8775 which was originally isolated from samples from 
Bangladesh. E. coli M56, which contains a 61-megadalton plasmid 
from E8775 Tox" and expresses CS6 has been described. The host for 
cloning was E. coli DH5a which was purchased from Bethesda Research 
Laboratories, Inc., Gaithersburg, MD. The host for plasmids used 
for production of heat, saline extracts was HB101 (EMB0 J. 
4:3887-3893 (1985) ) . 

Clones from E8775 were routinely grown in L broth. Antibiot- 
ics were added to L broth supplemented with agar as follows. 
Ampicillin was added, when appropriate, at 50 /ig/ml . Chlorampheni- 
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col was used at 30 /ig/ml. X-Gal (5-bromo-4-chloro-3-indolyl B-D- 
galactopyranoside, Sigma) was added at 0.004%, CFA plates were 
prepared as previously described ( Inf ect . Immun. 57:164-173 (1989) ) . 

Cloning CS6 from E8775 . The 61-megadalton plasmid from E. 
coli M56 was partially digested with Hin dlll and ligated to pUC19 
that had been digested with Hindlll. The ligation mixture was 
transformed into E. coli DH5a and plated onto L agar plates 
supplemented with ampicillin and X-gal. White (lac") colonies were 
picked to CFA plates supplemented with ampicillin and tested for 
CS6 expression using antisera as described below. Plasmids were 
purified as described ( Infect . Immun. 57:164-173 (1989)). Restric- 
tion enzymes were used according to the manufacturer's directions. 

Detection of CS6 Expression . CS6 expression by bacterial 
colonies was detected after transfer to nitrocellulose and 
treatment as described by Mierendorf ( Methods Enzymol. 152:458-469 
(1987)). Primary antisera was specific for CS6 and was raised in 
rabbits and absorbed as previously described ( Infect. Immun. 
57:164-173 (1989)), except that rabbits were inoculated intrave- 
nously with live bacteria suspended in normal saline. Secondary 
antibody was peroxidase-conjugated goat anti-rabbit IgG (Cappel 
Laboratories, Cochranville PA) and detection was by TMB Substrate 
(Kirkegaard & Perry Laboratories, Inc., Gaithersburg MD) . 

Positive identification of CS6 was by western blots of heat, 
saline extracts. Heat, saline extracts were prepared from bacteria 
grown on the indicated media as described ( Inf ect . Immun. 27:657-666 
(1980)). Proteins were separated on precast 16% Tricine sodium 
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dodecyl sulfate-polyacrylamide gels (SDS-PAGE, Novex Novel 
Experimental Technology, San Diego CA) and transferred to nitrocel- 
lulose. Blots were handled as described above for colony blots. 

Determination of N-terminal sequence . Heat, saline extracts 
were obtained from E8775 or clones of E8775 grown on L agar or CFA. 
Partial purification of CS6 was obtained by ammonium sulfate 
precipitation, with extracts sequentially precipitated at 20%, 40%, 
then 60% saturation. Samples at 40% and 60% saturation were 
dialyzed against deionized water and loaded onto precast 16% 
Tricine SDS-PAGE (Novex, San Diego, CA) . Proteins were blotted 
onto polyvinyl idene difluoride (PVDF) membranes (Westrans, 
Schleicher & Schuell, Keene, NH) , stained by Coomassie blue (Rapid 
Coomassie Stain, Diversified Biotech, Newton, MA) and bands of 
approximately 16 kDa were excised for automated gas-phase N- 
terminal sequencing analysis (Applied Biosystems Model 470A, Foster 
City, CA) . Data were analyzed using the Model 610A Data Analysis 
Program, Version 1.2.1 (Applied Biosystems, Inc, Foster City, CA) . 
These methods have been described in detail ( Infect. Immun. 
60:2174-2181 (1992)). 

DNA sequencing . DNA sequencing of the clones derived from 
E8775 was performed using the Model 373A DNA sequencing system from 
Applied Biosystems, Inc, Foster City, CA. Reactions were performed 
using the dideoxy method with fluorescent dye-labeled terminators, 
double-stranded templates, oligonucleotide primers, and AmpliTaq 
DNA polymerase following the manufacturer's protocol. Appropriate 
oligonucleotide primers were synthesized using a Model 391 DNA 
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Synthesizer (Applied Biosystems, Inc, Foster City, CA) . Plasmids 
were purified for use as templates by a slight modification of the 
alkaline lysis method and cesium chloride density gradient 
centrifugation described by Maniatis ( Molecular Cloning; A 
Laboratory Manual , Cold Spring Harbor Laboratory, Cold Spring 
Harbor (1982)). Plasmids were recovered by dialysis followed by 
multiple ethanol precipitations to remove residual salt. Sequence 
analysis was performed using software developed by the University 
of Wisconsin Genetics Computer Group ( Nucleic Acids Res. 12:387-395 
(1984)). 
RESULTS 

CS6 genes cloned from ETEC strain E8775 into pUC19 . A stable 
clone named M233 was obtained from a partial digest of the 61- 
megadalton plasmid from E. coli M56. It was a spontaneous deletion 
of a larger clone. The insert in M233 was approximately 24 kb. 
Subclones were obtained by digestion with various enzymes and a 
subclone containing 4.9 kb from the Hin dlll site to Kpn l was found 
that expressed CS6. This clone was designated M285. Expression of 
CS6 was verified by transferring plasmids into E. coli HB101 and 
detecting CS6 in heat, saline extracts. The cloned CS6 is 
expressed under the same conditions as CS6 from the native 61- 
megadalton plasmid (Table 1) . CS6 was detected in western blots of 
heat, saline extracts of bacteria grown on CFA, L agar or MacConkey 
agar. CS6 was not expressed on bacteria grown at 17 °C. 
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Tabl 

Strain: 

Plasmid: 

Chromosome: 

Media 
CFA 37 °C 
CFA 17 °C 
L agar 
MacConkey 



1. R gulation of 

M287 M56 
pM285 native 
HB101 HB101 



+ + 

+ + 

+ + 



CS6 Expression 

E8775 HB101 
native none 
native HB101 



+ 

+ 
+ 



N-terminal sequen ce of CS6. The protein sequence of the N- 
terminus of CS6 was determined from strains E8775 and from M233, 
the large clone derived from E8775. The 16 kDa proteins recovered 
from heat, saline extracts, and ammonium sulfate precipitation of 
M233 yielded two amino acids at each position (except cycle 12) 
indicating that two proteins were present. From the strength of 
the two signals, a probable primary sequence and a probable 
secondary sequence call was made for each of fifteen cycles. 
Quantitative analysis of the peak areas indicated that the molar 
ratio of the primary sequence (CS6A) to secondary sequence (CS6B) 
was approximately 3:1. The presence of the same two proteins was 
evident from strain E8775 grown on CFA agar and on L agar. 

DNA sequence of CS6 operons . The sequences of DNA cloned from 
E8775 (in M285) were determined. They are available from Genebank 
accession number U0484 6. The DNA sequence, when compared with 
sequences from another strain, were found to diverge abruptly on 
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both sides of the common area. Four open reading frames were 
detected. These were designated css A, css B, esse, and css D for CS 
six. 

The GC content of the DNA is 34% and the codon usage is in the 
range found for Escherichia coli genes that are expressed at low or 
very low levels as defined by Osawa et al ( Prokarvot ic Genetic 
Code. Experentia 46:1097-1106 (1990)). 

Genes encoding CS6 structural genes . The four open reading 
frames are preceded by consensus sequences for binding RNA 
polymerase and ribosomes. DNA and deduced amino acid sequence of 
cssA, a CS6 structural protein. The DNA sequence of the entire 
operon is available from Genebank accession number U04844. The 
deduced amino acid sequence from E8775 is given. The arrow 
indicates the site of cleavage of the signal peptide. The protein 
sequence is associated with the sequence for the second construct: 
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"35 -10 RBS 

TTGACACATTACGAATGTTATGTATACAATAAAAATGATTATAGCAATATTAATGGTGTTAT 

ATGAAGAAAACAATTGGTTTAATTCTAATTCTTGCTTCATTCGGCAGCCATGCCAGAACA 
MKKTIGLILILASFGSHART 2 

t 

GAAATAGCGACTAAAAACTTCCCAGTATCAACGACTATTTCAAAAAGTTTTTTTGCACCT 
EIATKNFPVSTTISKSFFAP 22 

GAACCACGAATACAGCCTTCTTTTGGTGAAAATGTTGGAAAGGAAGGAGCTTTATTATTT 
EPR IQPS FGENVGKEGALLF42 

AGTGTGAACTTAACTGTTCCTGAAAATGTATCCCAGGTAACGGTCTACCCTGTTTATGAT 
S VN LTV P ENVS Q VTVY PVY D 62 

GAAGATTATGGGTTAGGACGACTAGTAAATACCGCTGATGCTTCCCAATCAATAATCTAC 
EDYGLGRLVNTADASQSI IY82 

CAGATTGTTGATGAGAAAGGGAAAAAAATGTTAAAAGATCATGGTGCAGAGGTTACACCT 
Q IVDEKGKKMLKDHGAEVTP 102 

AATCAACAAATAACTTTTAAAGCGCTGAATTATACTAGCGGGGAAAAAAAAATATCTCCT 
NQQITFKA LNYTSGEKKIS P 122 

GGAATATATAACGATCAGGTTATGGTTGGTTACTATGTAAACTAA (Seq. #4) 
GIYNDQVMVGYYVN* (Seq. #5) 136 

The first open reading frame, css A was identified as the gene for 
the CS6 structural protein CS6A designated as the primary protein 
based on the amino acid N-terminal sequence. css A includes a 
signal sequence that is typical for many exported proteins. The 
deduced CssA protein from E8775 has 136 residues, as shown above 
and in Table 2. The molecular weight agrees with that previously 
determined from SDS PAGE. No homologous proteins were found by 
searching the protein databases, but conserved residues are present 
near the C-terminus and this is typical of f imbrial subunits that 
are carried across the periplasm by chaperons. 
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Tabl 2» Characteristics of Prot ins D due d from CS6 Operons 

Number of Molecular Isoelectric 



Protein Source Residues Weight Point 

CssA (CS6A) E8775 136 15,058 5.27 

CssB (CS6B) E8775 146 15,877 4.40 

CssC E8775 212 24,551 10.24 

CssD E8775 802 90,393 9.97 



cssB begins 17 bases downstream from css A, There is a typical 
signal sequence. cssB was identified as the gene for the CS6 
structural protein CS6B designated as the secondary protein based 
on the amino acid N-terminal sequence. The C-terminus matches the 
consensus typical of fimbrial subunits. The sequence from E8775 is 
given. The arrow indicates the site of cleavage of the signal 
peptide. 

MLKKIISAIA LIAGTSGWN A GNWQYKSLDV NVNIEQNFIP DIDSAVRIIP 3 0 

t 

VNYDSDPKLD SQLYTVEMTI PAGVSAVKIA PTDSLTSSGQ QIGKLVNVNN 80 
PDQNMNYYIR KDSGAGNFMA GQKGSFPVKE NTSYTFSAIY TGGEYPNSGY 13 0 
SSGTYAGNLT VSFYSN 146 (Seq. #6) 

A region of dyad symmetry is present 6 bases downstream from 
CssB in both clones. The sequence is GGCCGCA TTAT TGCGGCC (Seq. #2) 
in E8775 ETEC. Underlined bases form the GC rich stem. 

Genes with homology to fimbrial accessory proteins . cssC 
begins 48 bases downstream from css B. It has a typical signal 
sequence. The deduced proteins from both clones have 212 residues 
with 7 differences. A search of protein databases indicated CssC 
is homologous to chaperone proteins necessary for expression of 
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Pap, CS3, K88, K99, CS31A, S, and Type 1 fimbriae of E. coli and 
SEF14 of Salmonella enteritidis . Fl and pH6 antigen of Yersinia 
pestis , Type 3 of Klebsiella pneumoniae r Type b of Haemophilus 
influenzae, and filamentous heamagglutinin of Bordetella pertussis . 
The structure of PapD, the chaperone protein for Pap fimbriae, has 
been solved by X-ray crystallography and regions important for, 
conserving the structural domains have been identified. CssC 
conforms to the following consensus. Below is the deduced amino 
acid sequence of cssC. The * indicates conservative amino acid 
replacements. Dots are gaps necessary for aligning all sequences. 
Boxes indicate beta strands as defined for PapD. The designation 
of the beta strands for domain 1 (Al through Gl) and domain 2 (A2- 
G2) are given below each box. 



* 

NNF 



* *R*** 

EINKTRVIYS 



Al 



DSTP 


* * 
SVQISNN 


KAYP. . 


*** ** * 
LIIQSNVWDES 


NNKNH. .D 


****p***** 
FIATPPIFKM 




Bl 




CI 




Dl 



* 

ESES 



* **** 
RNIIKIIK 



** E 
TTI . . NLPDSQE 



** ** * * 
SMRWLCIESM 



P* * * * 

PPIEKST. . KINRKEGRTDSINISI 



110 



El 



Fl 



*K**** 
RGCIKLIYR 


p* * 

PASVPSPVFNN . IVEK 


1 

* 

LKWHK 


1 

* 

NGKY 


1 

* * N 
LVLKN 


1 

*p** 
NTPYY 


1 

*** * 
ISFSEVF 


Gl 




A2 




B2 




C2 



160 



* 

FDSDKV. . NNAKD 


* * 
ILYVK 


P 

PY 


* * 
SEKKID 


* * 
ISN. .RIIKKI 


* 

KWAMI 


*D G* 
DDAGAKT 


l 

KLYESIL| 




D2 




E2 




F2 




G2 



(Seq. #7) 
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CssD begins 14 bases upstream of the end of cssC. When 
compared with a second sequence there are 28 differences between 
CssD from E8775 and the other sequence. The deduced protein from 
cssD is homologous to molecular ushers found in operons of Pap, 
CS3, K88, K99 and Type 1 fimbriae of E. coli and SEF14 of Salmonel- 
la enteritidis, Fl of Yersinia pestis . and Type 3 of Klebsiella 
pneumoniae. Overall, CssD and the other proteins are only around 
30% identical and around 50% similar. Asterisks above the CssD 
sequences indicates amino acids conserved relative to molecular 
ushers . 

***** * ** * ** 

MNQFYKKSHYSIQKHQITGLLFLLFIYPFSTSYGNEQFSFDSRFLPSGYN 50 

* ** G Y *** *N * * * C** * * 

YSLNSNLPPEGEYLVDIYINKIKKESAIIPFYIKGNKLVPCLSKEKISSL 100 

G* * * *c** ** ********** * * 

GININNNDNTECVETSKAGISNISFEFSSLRLFIAVPKNLLSEIDKISSK 150 

* G* ********* ******* w** * 

DIDNGIHALFFNYQVNTRLANNKNRYDYISVSPNINYFSWRLRNLFEFNQ 200 

****** * * *g* *** *** * g* * ** 

NNDEKTWERNYTYLEKSFYDKKLNLWGESYTNSNVYNNYSFTGISVSTD 250 

M * * * **a * * * * * v**G*F** 

TDMYTPSEIDYTPEIHGVADSDSQIIVRQGNTIIINESVPAGPFSFPITN 300 

* ****** ** **** ****** 
LMYTGGQLNVEITDIYGNKKQYTVNNSSLPVMRKAGLMVYNFISGKLTKK 350 

* * *G ***** * ** * g** *G G* 

NSEDGDFFTQGDINYGTHYNSTLFGGYQFSKNYFNLSTGIGTDLGFSGAW 4 00 

**** * *** ***** ** *** 

LLHVSRSNFKNKNGYNINLQQNTQLRPFNAGVNFDYAYRKKRYVELSDIG 450 

** * **** * * * * y* 
WHGNLYNQLKNSFSLSLSKS LNKYGNFSLDYNKMKYWDNAYDSNSMSIRY 500 

* ***p* * * ** \ 

FFKFMRAMITTNCSLNKYQSYEKKDKRFSINISLPLTKDYGHISSNYSFS 550 
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* ******** 
NANTGTATSSVGLNGSFFNDARLNWNIQQNRTTRNNGYTDNTSYIATSYA 600 

***** * G *** ** * * * *** 

SPYGVFTGSYSGSNKYSSQFYSASGGIVLHSDGVAFTQKAGDTSALVRID 650 

* *** G ** ****** ** * 
NISDIKIGNTPGVYTGYNGFALIPHLQPFKKNTILINDKGIPDGITLANI 700 

******** * * *** ** **** * 

KKQVI PS RGA I VKVKF DAKKGND I LFKLTTKDGKTPPLGAI AHEKNGKQI 750 

**** * * * * * c * 

NTGIVDDDGMLYMSGLSGTGIINVTWNGKVCSFPFSEKDISSKQLSWNK 800 

C 

QC 802 (Seq. #8) 

But comparison with the protein from another strain, the sequence 
data shows the proteins have areas of high homology dispersed 
throughout, especially the first 410 residues, CssD has 4 
cysteines (residues 91, 112, and two near the C-terminus) which are 
conserved in all ushers. 

A region of dyad symmetry is present 347 base pairs into the 
CssD gene in both clones. The calculated free energy value of 
these structures is -7.2 kcal. 

DNA flanking th e CS6 genes . When compared with another 
strain, the DNA sequences of the two clones diverge immediately 
ldownstream of css D and 96 bases upstream of cssA. The non- 
homologous flanking regions have homology with five distinct 
insertion sequences. The homologies include 3% to 32% of each 
insertion sequence but not entire insertion sequences. The 
homology of and Iso-ISl in E8775 continues beyond the clones we 
have sequenced and may be a complete insertion sequences in the 
native plasmids. 
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It should be noted that minor variation in bases of the 
peptides does not destroy antigenicity. A protein having at least 
60% homology with the CS6 A and B proteins identified herein having 
conservative substitution would be expected to have desirable 
properties. 

As indicated previously, bacteria transformed with plasmids 
which express the CS6-A and CS6-B proteins may be administered by 
mouth. If the transformed bacteria are attenuated strains, they 
may be delivered live. It is also possible to administer killed 
bacteria. Carbonated beverages such as carbonated water are 
particularly useful as carriers which are inexpensive. When the 
bacteria are administered in a carrier wherein the pH is not over 
7, an antacid may be given with the bacteria. 

The CS6 A and CS6 B proteins may also be at least partially 
purified and administered by mouth by means usually used in the art 
to deliver antigens to the intestinal tract, including in protected 
forms such as liposomes, microcrystals, microdroplets , as microen- 
capsulated formulations or as enterically coated capsules. 
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